Census Data Mining – An Application

نویسندگان

  • Willi Klösgen
  • Michael May
چکیده

Because of data privacy regulations, census data are available for analysis only in aggregated form. Primary data (responses of persons) are aggregated in many cross tabulations for small geographical units. Thus the target objects of secondary analysis are small areas (enumeration districts or wards ). Any cell or marginal of a cross tabulation can be used as variable on these target objects. The target objects can be linked with other spatial objects (e.g. rivers, roads, railway lines) for spatial analyses. In this paper we discuss the special requirements that occur for this type of aggregate data mining including spatial analyses. We show an application of SubgroupMiner, which is an advanced subgroup mining system supporting multirelational hypotheses, efficient data base integration, discovery of causal subgroup structures, and visualization based interaction options.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pixel Based Visual Mining of Geo-Spatial Data

In many application domains, data is collected and referenced by geo-spatial location. Spatial data mining, or the discovery of interesting patterns in such databases, is an important capability in the development of database systems. A noteworthy trend is the increasing size of data sets in common use, such as records of business transactions, environmental data and census demographics. These ...

متن کامل

Discovery of spatial association rules in geo-referenced census data: A relational mining approach

Census data mining has great potential both in business development and in good public policy, but still must be solved in this field a number of research issues. In this paper, problems related to the geo-referenciation of census data are considered. In particular, the accommodation of the spatial dimension in census data mining is investigated for the task of discovering spatial association r...

متن کامل

A study on the application of data mining to disadvantaged social classes in Taiwan's population census

Data mining has been widely applied to different areas. For a country with a huge population and household census data, data mining is an ideal approach for analyzing this information. In Taiwan single-parent families, aborigines and the elderly have long been considered disadvantaged social classes, and their widening problems will have a tremendous impact and influence on society. This study ...

متن کامل

Mining spatial association rules in census data

In this paper we propose a method for the discovery of spatial association rules, that is, association rules involving spatial relations among (spatial) objects. The method is based on a multi-relational data mining approach and takes advantage of the representation and reasoning techniques developed in the field of inductive logic programming (ILP). In particular, the expressive power of predi...

متن کامل

Visual Data Mining of Large Spatial Data Sets

Abstract. Extraction of interesting knowledge from large spatial databases is an important task in the development of spatial database systems. Spatial data mining is the branch of data mining that deals with spatial (location) data. Analyzing the huge amount (usually terabytes) of spatial data obtained from large databases such as credit card payments, telephone calls, environmental records, c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002